A Single (Unified) Shader GPU Microarchitecture for Embedded Systems
نویسندگان
چکیده
We present and evaluate the TILA-rin GPU microarchitecture for embedded systems using the ATTILA GPU simulation framework. We use a trace from an execution of the Unreal Tournament 2004 PC game to evaluate and compare the performance of the proposed embedded GPU against a baseline GPU architecture for the PC. We evaluate the different elements that have been removed from the baseline GPU architecture to accommodate the architecture to the restricted power, bandwidth and area budgets of embedded systems. The unified shader architecture we present processes vertices, triangles and fragments in a single processing unit saving space and reducing hardware complexity. The proposed embedded GPU architecture sustains 20 frames per second on the selected UT 2004 trace.
منابع مشابه
GPU Performance Assessment with the HPEC Challenge
Commodity graphics processing units (GPUs) are highly parallel programmable microprocessors. The current high end GPUs offer a peak performance of 500 GFLOP/s in single precision. Previous methods of performing generalpurpose computation on GPUs required algorithm implementations be cast as 3D graphics operations with prohibitive limitations on programmable vertex and pixel shader length, contr...
متن کاملImplementation and Optimization of Image Processing Algorithms on Embedded GPU
In this paper, we analyze the key factors underlying the implementation, evaluation, and optimization of image processing and computer vision algorithms on embedded GPU using OpenGL ES 2.0 shader model. First, we present the characteristics of the embedded GPU and its inherent advantage when compared to embedded CPU. Additionally, we propose techniques to achieve increased performance with opti...
متن کاملGPGPU Based Particle System Simulation ∗
General purpose computing on graphics processing units, known as GPGPU but now often referred to as GPU computing, is the approach of performing computation on the GPU instead of the CPU. GPU computing has been made possible by the increasing programmability and performance of GPUs. The programmability of GPUs is accessed via shader programs, typically written in a C like language. Until recent...
متن کاملConsumer graphics cards for fast image processing based on the Pixel Shader 3.0 standard
Optical inspection and metrology for industrial applications often require fast image processing. Today this is mostly reached by special developed hardware, which represents a relevant cost factor in image processing systems. Furthermore, such special solutions often cannot be easily adapted to new requirements. Otherwise, CPU based systems feature a good programmability but often not enough p...
متن کاملReal-time subsurface scattering on the GPU
We present a GPU algorithm that computes subsurface light transport in real time on arbitrary animated meshes. We evaluate both single scattering and multiple scattering, by using piecewise linear and ring-based approximations of the surface in the fragment shader. We demonstrate our technique on animated meshes at 60 fps.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005